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Abstract 

In this paper we view the steady states of classical random walks over complex networks with 
an arbitrary degree distribution as states in thermal equilibrium. By identifying the distribution 
of states as a canonical ensemble, we are able to define the temperature and the Hamiltonian for 
the random walk systems. We then calculate the Helmholtz free energy, the average energy, and 
the entropy for the thermal equilibrium states. The results shows equipartition of energy for the 
average energy. The entropy is found to consist of two parts. The first part decreases as the 
number of walkers increases. The second part of the entropy depends solely on the topology of 
the network, and will increase when more edges or nodes are added to the network. We compare 
the topological part of entropy with some of the network descriptors and find that the topological 
entropy could be used as a measure of network complexity. In addition, we discuss the scenario 
that a walker has a prior probability of resting on the same node at the next time step, and find 
that the effect of the prior resting probabilities is equivalent to increasing the degree for every node 
in the network. 
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I. INTRODUCTION 


Complex networks describe a variety of systems of importance such as the Internet, 
the World Wide Web, the cell, and social networks M- For example, the Internet is a 
complex network that consists of billions of devices that are connected by physical links 
|4]; Social networks are made up of humans or organizations that are linked by various 
social relations [5j; The World Wide Web is a network with large amounts of web pages 
being connected by hyperlinks. Traditionally large scaled networks have been described 
as random graphs by Erdos-Renyi model which have every pair of nodes connected with 
probability p such that edges are distributed randomly HE]. In recent years the increased 
computing power and the computerization of data acquisition allow scientists to investigate 
real large scaled networks and End deviations of the topologies of real large networks from 
a random graph MD]. In most real networks the clustering coefficient is much larger 
than it is in comparable random graphs. The distribution of node degrees in many real 
networks significantly deviated from a Poisson distribution that is predicted by a random 
graph. In fact, the World Wide Web and the Internet have a power-law tail in their degree 
distribution thus are scale-free networks m • New models and methods are thus needed to 
understand the topology, the underlying organizing principles, and various dynamics that 
take place on networks HOT. For example, statistical mechanics offers a framework for 
describing the topology and the evolution of these network systems, and also provides tools 
and measurements to quantitatively depict these organizing principles [13]. 

Random walks provides an explanation for many stochastic processes in various fields 
such as chemistry, computer science, physics, and ecology E3- Usually, random walks 
are assumed to be Markov processes [16] . A random walk in discrete time steps on a 
complex network is a special case of a Markov chain and can be viewed as a generalization 
of Drunkard’s walk S3- When a walker is on node a of the network the walker picks the 
available edges that are linked to the node with equal probability. Thus, if node a has 
K a edges the walker will go to each one with probability 1/K a at the next time step. We 
can ask the questions like what is the average time for the walker to return to its starting 
node? What is the average time for the walker to reach another node b on the network? A 
quantity called the Mean First Passage Time (MFPT) gives answers to the questions p~8j . 
The MFPT ( T aa ) for a walker on node a to return to the same node is K a /K, depending 
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only on the degree K a and the total number of degree K of the network. Studies on MFPT 
between two different nodes a and b show asymmetry between (T ab ) and ( T ba ). Often the 
asymmetry between the MFPT has explicit degree dependence. The results can be used to 
estimate the size of large networks and provide a measure for effectiveness in communication 
between nodes. 

Random walks over complex networks also possess another intriguing property. A random 
walk is ergodic and after a long time the probability P a of finding a walker on node a is 
solely determined by the degree K a of the node and the total degree Ii. The probability 
distribution of P a always reaches a steady distribution, independent of the initial position for 
the walker [T9]. The process is similar to the processes in thermal systems that go from non¬ 
equilibrium states to equilibrium states. Independent of various initial conditions, a thermal 
system could reach the same equilibrium state in certain conditions after a relatively long 
time. For example, the free expansion of ideal gas and the diffusion of ink drops in a glass 
of water. It thus suggests a random walk on network to be viewed as a thermal system. In 
classical thermodynamics the state of a thermal system is defined as a condition uniquely 
specified by a set of properties m- Here we use the probability distribution {P a } to specify 
the states of a random walk on network. The topological factors of a network such as the 
number of nodes M , the degree distribution {K a }, and the distribution of edges linking 
the nodes are viewed as the parameters that specify the phase space of a random walk 
system. Therefore, an equilibrium state in the systems of random walk over networks is one 
in which the probability distribution {P a } does not change with time unless the system is 
acted upon by external influences. A non-equilibrium state has its probability distribution 
{P a (t)} vary with time. A stochastic process in the random walk system is thus a path that 
consists of a series of states through which the system passes. In statistical thermodynamics 
a thermodynamic system is regarded as an assembly of enormous number of ever-changing 
microstates. The basic assumption of statistical thermodynamics is that all microstates 
of an assembly are equally probable. The thermal equilibrium state is defined as the most 
probable state that has the largest number of corresponding microstates in a thermodynamic 


system. In section[ITT]we find the probability of the distribution of N non-interacting random 
walkers {n a } that gives the steady state as the most probable state in the thermodynamic 
system. 

In this paper we consider an arbitrary finite network which consists of nodes a = 
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1,2with undirected edges. The network is assumed to be connected, i.e, there 
is at least a path between each pair of nodes (a, b). The connectivity of the network is 
represented by the off-diagonal adjacency matrix A with its elements A a f, to be either 1 (if 
(a, b) are directly connected) or 0 (if (a, b) are not directly connected). The degree K a of 
node a is defined to be the total number of edges that link directly to node a from other 
nodes in the network. A classical walker moving on the network is stochastic and can be 
described by the master equation 

|P(( + 1))=^B- 1 |P(«)). (1) 


Here A is the adjacency matrix, D denotes a M xM diagonal matrix with diagonal elements 
D aa = K a , and | P(t)) is a M x 1 column vector whose element P a (t) is the probability of 
finding a walker on node a at time step t. Independent of the initial conditions |P(f)) is 
shown to approach to a steady probability distribution 17r) at large time t —y oo with 


7T a 



( 2 ) 


where 7r a is the element of 17r), and K = J2 a K a is the sum of all degrees. From the result in 
Q the average return time ( T aa ) is easily derived as follows. Consider a walker that moves 
on the network from time step to to to + At (At 1). Initially at time step to the walker is 
on node a. During the time interval the expected number of finding the walker on node a is 
7r a At, and thus the average return time for node a is found by ( T aa ) = At/(7r a At) = K/K a . 
Other interesting quantities about the network such as the mean first passage time (TA) 
and the random walk centrality can also be found by studying the master equation in (|T|) 

m- 

Although the master equation provides detailed information about the motion of a walker 
on the network, the fact that the probability distribution | P(t)) finally approaches to a 
steady state 17r) does suggest that 17r) can be thought of as a thermal equilibrium state in a 
thermal system. For a random walk on network with N non-interacting walkers (N 3> 1), 
the system should have a “temperature” T when it reaches its thermal equilibrium. Thus 
(N, M, K, {K a },T) are viewed as state variables for the random walk thermal system. In 
section [El] we use the result in ([2]) to help us define the so-called “temperature” T for the 
steady states of classical random walks on networks. We then calculate various state func¬ 
tions such as the internal energy, the entropy, and the Helmholtz free energy for the steady 
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state (viewed as a thermal equilibrium state) in section III We find that the topological part 
of the entropy can be used as a measure of network complexity. In section |IV| we discuss 
a modified random walk model that has prior resting probabilities for a walker to stay on 
nodes at the next time step. In the last section we compare the topological entropy to other 
network descriptors and give our conclusion. 


II. STEADY STATES AS THERMAL EQUILIBRIUM STATES 

Instead of considering a single walker on a network, we assume N ( N 1) mutually 
non-interacting classical walkers that move on a connected complex network with M nodes. 
The move of walkers is ergodic and will reach an equilibrium after a substantial long period 
of time. It strongly suggests that we view the steady state as a thermal state in equilibrium. 
As shown in FigjlJ any undirected edge of the network can be viewed as consisting of two 
directed links. Each node in the network has equal numbers of outgoing and incoming links 
attached to it. In the steady state, the distribution of walkers {n a } does not change in time. 
This is only possible when all the directed links, outgoing and incoming, have the same 
flux of walkers at each time step. It is easily checked that the steady state has the walker 
distribution {n a = NK a /K} or equivalently, the probability distribution {P a = K a /K}. 

In classical thermodynamics the concept of temperature is a property of a system that 
determines if thermal equilibrium exists with some other system. Once the steady state 
of random walks is viewed as a thermal equilibrium state then it is possible to define the 
“temperature” for the steady state. To find the temperature let’s consider the contact of two 
random walk systems, the network 1 of Ni walkers with M\ nodes and the network 2 of N 2 
walkers with M 2 nodes. Both random walk systems are assumed to be in their steady states. 
So the number of walkers on node a in the network 1 is = NiK^/K^ l \ where K^ ! is 

the degree of node a and K F) is the total degree of the network 1. Similarly, the number of 
walkers on node b in the network 2 is n[ 2 ^ = N 2 K^/K^ with being the degree of node 
b and K 1 ' 2 ' 1 being the total degree of the network 2. We then bring the two systems together 
and join them by linking A Mi nodes of the network 1 to A M 2 nodes of the network 2 as 
shown in Figure [2] The number of linking edges l between the two networks is assumed 
to be much smaller than A'F) and K^ 2 \ Under the small ^ assumption the total degree of 
the combined network can be thought of as the sum of the degrees of the two networks. 
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The combined network now has IV = IVi + IV 2 walkers moving on it with M = Mi + M- 2 
nodes and the total degree K = A^ 1 ) + A^ 2 ). When the distribution of walkers reaches an 
equilibrium in the combined system , the number of walkers on node a in the network 
1 and the the number of walkers nl ’ on node b in the network 2 are 


(1) _ ^1+^2 A qi) 

KW + AT( 2 ) ° ’ 

(2) _ Ni + N 2 (2) 

b KW + AT( 2 ) 6 ' 

The differences (h^ — n W) and ( n^ — n^) are thus found as 


hi) _ Nl N 2 KM KW (1) 

KWK { W - ’ a ’ 


No 


:( 2 ) 


( 2 ) _ N ± N 2 k® km 


Ub Ub RM)K y No 




)Ki 


( 2 ) 


( 3 ) 

( 4 ) 

( 5 ) 

( 6 ) 


From the results in Eq.s(5]6) the net flow of walkers will go from network 2 to network 1 if 
K^/Ni > K^/N 2 is satisfied. When K^/Ni < K^/N 2 the net flow of walkers will go 
from network 1 to network 2. There is no net flow of walkers between the two networks if the 
ratio A^d/TVi equals A^ 2 ) /N 2 . It thus suggests that the ratio K/N could be the candidate 
of “temperature” for the steady state of the random walk on a network of N walkers with 
total degree K. It is interesting to note that, with the ratio K/N being identified as the 
temperature for a random walk in equilibrium, the final temperature after combining two 
equilibrium random walks is always between the two temperatures of the two random walks 
before combining them. In the next section more reasons will be provided for identifying 
K/N as the temperature. 


III. CANONICAL DISTRIBUTION FOR CLASSICAL RANDOM WALKS ON 
NETWORKS 

The probability of obtaining the distribution of walkers {n a } over nodes is found as 

P(.{n a };N,{K}) = N (7) 

The result in (JT]) is obtained by using the probability distribution {P a = K a /K} for a single 
walker and then calculate the probability of having the walker distribution {n a }. In the 
large N limit, it has 

N f 

In P({n a };N, {K a }) = -- £ & In(f-) + 0(ln N). (8) 

K a K 0 , 
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Here £ a = n a K/N ranges from 0 to K. The steady state 17r) corresponds to the distribution 
of £ a , {£ a = K a }. From ([8]), we find the relative frequency hi for the distribution {£ a } 

Si(fe}, {K.}) = exp[-^tf ({£,}; {*.})], (9) 

with 

#({«;{«■„}) = (io) 

a 

The relative frequency resembles the canonical distribution for a thermal system with 
Hamiltonian H({C, a }] {K a }) that is in equilibrium with a heat bath at temperature T = 
K/N. The Hamiltonian 77({£ a }; {K a }) is easily proved to be non-negative and has the 
minimum H min = 0 which corresponds to the steady state, i.e., {£ a = K a }. The temperature 
is identified as T = K/N since when j3 = 1/T increases the expectation value of (H) will 
decrease and the system is likely to lie in the low-energy states. 


A. Partition function, Helmholtz free energy, and the entropy 


The partition function for the random walk system is calculated as follows. Consider the 
expansion of Hamiltonian 77({£ a }; {K a }) 

M (f — K ^ 

//({«; {/U}) = £ + 0(A? 3 )- (11) 

0=1 

Here and (£ a — K a ) are of the same order of magnitude. The partition function Q M for 
the thermal system is defined as 


Qm=J2 exp({£„}; {K a })}. (12) 

Dehne new variables Xa = Ha ~ K a ) so that J2 a Xa = 0 is satisfied. We then calculate Qm 
and get 


M M a 

Qm — {YldXa}S{J2Xa)exp[-J2' 

J o=l a 


o=i ^Aa 


,27T n 

'J' 


M 


(M-l)/2 K -l/2 K l/2' 


(13) 


a =1 


From the partition function Qm in (13), we find the average “energy” for the random walk 
system in equilibrium 


(H) 


d In Qm = (M - 1 )K = (M - 1 ) 
d/3 2 N 2 


(14) 
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The result in (14) is exactly the equipartition of energy in thermodynamics. The total degree 
of freedom for the random walk system is (M — 1) as can be easily seen from the number 
of independent variables in the Hamiltonian H({£ a }] {K a }), and each degree of freedom 
acquires the same average energy T/2. 

The Helmholtz free energy F for the random walk system is found by 


M 


F = In Q m — ~{(M - 1) ln(27rT) - In K + £ In K a }. 


a=1 


From the Helmholtz free energy we obtain the entropy S for the system 


M 


S = - % = [ln(27rT) + 1] - \ In K + \ £ In K„. 


(15) 


(16) 


Obviously, the entropy S in (16) does satisfy the relation dS/d{H) = 1/T. All state functions 


in the equations (14^16) depend (at least partially) on the state variables M,T,K, and 
{Ka}- 

The entropy S consists of two parts 


M - 1 

S= -—— hi N + S t0 p, (17) 

M — 1 M — 2 l M 

Stop = —^— ln ( 2?re ) d- 2 — hl K + 2 ^ ln Ka ’ ^ 18 ' > 

where S top denotes the part solely determined by the topology of the complex network. In 
general, without changing the network topology the entropy S decreases as the total number 
of walkers N increases. On the other hand Stop seems like to be an interesting property of 
the complex network. First, the topological entropy S top will increase by adding more edges 
to the network. For example, an edge is added to the network between node a and node b. 
As a result, both the degree K a and K b are increased by 1, the total degree K is increased 
by 2, and the topological entropy is changed by A S top 

AS top = —-— ln(l + —) + -{ln(l + + hi(l + —)} > 0. (19) 

Second, adding more nodes to the network also increases S top . Suppose a new node is 
connected to nodes 1,2of the network. In the situation the number of nodes is 
increased by 1 , all the degrees of nodes 1,2, ... ,q are increased by 1 , and the total degree is 
increased by 2 q. Thus the change in S top is 

A Stop = ~ —^— ln(l + jw) + 2 [l n (27T J K') + 1] + - ^2 ln(l + —) + -Inq 


> 0 . 


( 20 ) 













From the results in (19) and (20) we conclude that the topological entropy S top always 
increases as more nodes or edges are added to complex networks. It strongly suggests that 
the topological entropy S top could be used as a measure of network complexity. Among 
all connected networks with M nodes, the star topology (e.g. the network A in Fig|3] for 
M = 5) has the minimum value of Stop 


S 


(min) 

top 


(M) = - 1)] + 1} + 


( 21 ) 


while the fully connected topology (e.g. the network L in Fig|3]for M = 5) has the maximum 
value of Stop 

= ^l{ln[2x(M - l) 2 ] + 1} + Kll In M. (22) 

As an example, let’s consider four networks A, F, K and L as shown in Figure [3} All the 
four networks have five nodes but are of different topologies. Among the four networks the 
star topology (network A) has the minimum value of Stop while the fully connected topology 
(network L) has the maximum Stop■ All nodes in the network A have just one edge linked to 
them except for a central node that is connected to all other nodes. Contrarily, the network 
L has no central node. Any two nodes are linked by an edge in the network L. It has 
Stop = 13.635 > S^ op = 12.725 > Sf op = 10.863 > Sf op = 9.488. The above result suggests 
that the network A is less complex than the network F, the network F is less complex than 
the network K, and the network K is less complex than the network L. 


IV. RANDOM WALK WITH A PROBABILITY OF RESTING ON NODES 

In the section we consider a slightly different scenario: a walker has a prior probability 
cr a of resting on node a at each time step. The scenario is indeed related to the random 
walks on the connected networks that have self-linked edges linking a node to itself. As an 
example let’s consider a connected network with self-linked edges as shown in Figure |4} A 
self-linked edge is attached to node a and two self-linked edges are attached to node c. Since 
each self-linked edge contributes the amount of 2 to the degree, we thus find the degrees 
of the nodes to be {K a = 4, Kb = 2, K c = 6, Kj = 2} and the total degree of the network 
is K = 14. For a walker on node c at time step t, it has the probability 1/6 of hopping 
to node b or d at the next time step. In other words, it has the probability 2/3 of resting 
on node c at time step t + 1. Similarly, a walker on node a at time step t could stay on 


9 








the same node with the probability 2/4 at the next time step. From the distribution of 
degrees we thus find the probabilities of finding a walker on the nodes in the steady state, 
(A = 4/14, A = 2/14, P c = 6/14, P d = 2/14}. 

The discussion is easily generalized to the case of TV non-interacting random walkers 
moving on a connected network of M nodes with the degree distribution { K a } and the 
distribution of prior resting probability on nodes {cr a }. Once again, here <r a denotes the 
probability of resting on node a at the next time step for a single walker. The case is equiv¬ 
alent to a conventional random walk on a network of M nodes with the degree distribution 

{Kh 


K = K a + 2 A = 


K n 


(1 -*«)’ 


I = aa 

“ 2(1 - CT a ) 


K n 


(23) 

(24) 


where L a denotes the number of the self-linked edges attached to node a. In general, L a 
ranges from zero to infinity and may not be an integer. We thus find the temperature T of 
the modified random walk model as 

K' 

r = v (25) 

with the effective total degree K' 


k' = y Ka . 

a (1 - Oa) 


The probability of having the walker distribution {n a } is thus found by 

1 K 

1 r ^ a 1 n a 

nj (1 — (Taj 


TV' 1 K 

P(M; N, {Ka}, {<T„}) = jpjj n ' 


(26) 


(27) 


The probability distribution P({n a }; TV, {K a }, {cr a }) depends not only on the topology of the 
network but also on the distribution of the node-resting probability {cr a }- In fact, it is better 
to view {a a } as one of the topological factors of the network. Other state functions such as 
the average energy, the Helmholtz free energy, and the entropy of the modified random walk 
system are obtained by replacing K a with K' a , and replacing K with K' in the equations 
( JTdj [TTj| [l6| . For example, the topological part of the entropy in the modified random walk 
model is 

Stop = —g— ln ( 27re ) + ~2 — ln K ' + 2 £ ln K '«- ( 28 ) 
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In conclusion, the effect of resting on nodes is equivalent to increasing the degree at every 
node in the network. The topological entropy also increases in the modified random walk as 
compared to that in the conventional random walks without the prior resting probability. 


V. DISCUSSION 

In this paper we consider N non-interacting classical walkers that move on a connected 
network with M nodes and the distribution of degrees {K a \. We found that the steady 
state of the random walk system can be interpreted as a thermal state in equilibrium. We 
then identified the temperature for the system to be K/N , where K is the total degree, 
and calculated various state functions of the system such as the average energy (H), the 
Helmholtz free energy, and the entropy S. The results show that the equilibrium thermal 
state has the property of equipartition of energy. Without changing the network topology 
the entropy S increases as the total number of walkers decreases. In fact, the entropy S 
consists of two parts. The first part depends solely on the number of walkers N and the 
number of nodes M. The second part, called the topological entropy S top in the paper, 
depends solely on the network topology. In general, S top will increase as more edges or 
nodes are added to the network. It thus suggests that S top can be used as a measure of 
network complexity. 

There is no absolute definition of what complexity of networks means. Even without a 
precise definition of network complexity, a few descriptors are used as measures of network 
structure [2UE2]. For example, the information theoretic index for node degree distribution 

M 

I vc = K log 2 K-Y, K a log 2 K a , (29) 

( 2=1 

which makes use of Shannon’s formula for the total information content of the vertex distri¬ 
bution. Usually I vc increases with the connectivity and other complexity factor such as the 
number of cycles. As shown in Fig. [3j the total degree K increases from K = 8 to K = 20 
for the five-node graphs from A to L. The plot of the normalized index 

4 C = — t~—^\ i (30) 

UL - j[rnax) 7 v ' 

-Lvc 

is shown in Fig. 0 with l\™ ,ax ' ) being the maximum of I vc for the five-node networks. It 
shows that I vc does increase with connectivity. The number of cycles is also considered as 
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a stronger network complexity factor, and this is seen in the sequence of graphs with one 
to five cycles: F —>■ H —» J —!>■ K —> L. Another useful descriptor that describes the overall 
degree of network clustering is the average clustering coefficient (C) 

g gCa 

M ’ 

2 fa 


(C) = 

c n = 


(31) 

(32) 


K a (K a -iy 

Here f a denotes the number of edges that link the first neighbors of node a, and C a is the 
clustering coefficient of node a. As seen in Fig. [3j the clustering coefficients of all nodes in 
the acyclic graphs A, B, and C, are zero. All vertices in cyclic graphs having four or more 
vertices also have zero clustering coefficients. Nonzero clustering coefficients can only be 
obtained in tri-membered cycles. More tri-membered cycles in a graph usually give a higher 
( C) value, as can be seen in the sequence of graphs with one to three tri-membered cycles: 
D —> G —>■ J. Other descriptors such as the B 2 and B 3 indices make use of vertex and vertex 
distance distribution 

M K 

32 = E TT. < 33 ) 


a=l Da 

M K K 

B3 = B2 log 2 B2~Y,-^ log 2 -jp 


o=l Da 


(34) 


Here D a is the sum of all the minimum distances D a i between node a and other nodes (node 

b) 

Da = Y, Dab. (35) 

b^a 

The indices B2 and B 3 increase with the connectivity and the number of cycles, similar to 

the index I vc . However, unlike the index I vc , B2 also increases with the appearance of an 

additional branch. This is seen in the plot of the normalized B2 index in Fig. [5] for the 

sequences of graphs of the same total degree: C—)-B —> A, F—)-E—)-D, H —> G, J—* I. 

In order to compare the topological entropy S top and the other network descriptors I vc , ( C ) 

and B 2, the descriptors are normalized and thus range from 0 to 1. For example, the graph 

L in Fig. [3] has the maximum value of I vc = 46.439 among all the five-vertex graphs. We 

thus define the normalized index I vc = J„ c /46.439. Similarly, the normalized B2 index is 

defined as B? = /A/S.O for all the five-vertex graphs in Fig. [ 3 } The normalized topological 

entropy S top for all M- node networks is defined as 

"1 

^top 


Stop — 


Star, - ln(27re) 


{Stop -2 1 ln(27re)} r 


(36) 
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The term (M — 1) ln(27re)/2 is subtracted from S top in equation (36) since it is the same 
for all M -node connected networks. A plot of S top , Ivc, (C), and B 2 for all the five-node 
networks in Fig. [ 3 ] is given in Fig. [HJ As easily seen from Fig. [HJ S top and I vc behave 
similarly as descriptors of network structure. Both of them increase with the connectivity 
and the number of cycles. Contrary to the B 2 index, St op will decrease with the appearance 
of an additional branch. Thus the network with star topology will have the minimum value 
of S top , while the network of a chain of nodes will have the minimum value of B 2. All 
the descriptors in Fig. [5] will reach their maximum values when the network has a fully 
connected topology. 

Although Stop behaves like I vc for the five-vertex networks in Fig. [3j they are different in 
a few aspects. First, the topological entropy S top depends explicitly on the total number of 
M while I vc depends on M implicitly. This originates from the fact that the index I vc makes 
use of Shannon’s formula for the total information content of the vertex distribution, while 
the topological entropy S top makes use of the distribution of classical random walkers over 
nodes. Second, the order of magnitude of S top for any M -vertex networks is 0(M log M). 
On the other hand, the order of magnitude of I vc is 0(M log M) for the network of a star-like 
topology, and is 0(M 2 log M) for the network of a fully connected topology. It means that 
I vc increases much more rapidly than St op when more and more edges are appearing between 
nodes. This is shown in Fig. [6] for 100-vertex networks. For any networks with fixed number 
of nodes M and fixed total degree K, the indices I VC (M, K;{K a }) and S top (M, K ; {K a }) 
descriptors are viewed as functions of the node distribution {K a }. Theoretically, there are 
upper bounds for I VC (M, K; { K a }) and S top (M, K\ {K a }) with fixed values of M and K 


I V c(M , K] {K a }) < I VC (M, K; {K a = K/M }) = K log 2 M, (37) 

S top {M , K- {K a }) < S top (M , K ; {K a = K/M}) 

M - 1 M 

= ———[ln(27r) + 1]-—In M + (M — 1) In K. (38) 

The results in the above equations show that the upper bound of I vc increases linearly with 
K for M -node networks, and the upper bound of S top is a linear function of log K for M-node 


networks. After truncating the A'-independent terms in (38), the truncated upper bound of 
S top is normalized as 


S" = 


In K 


top ln(M(M — 1)) 


(39) 
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The normalized upper bound I' vc is defined as 


J' = 


K 


(40) 


Fig. [6] shows the normalized upper bounds I' vc and S' top for M = 100. The total degree 
K ranges from 2(M — 1) to M(M — 1) in Eq.(40) and (39). When M is large, S' top goes 
approximately from 0.5 to 1.0 as K increases, different to the upper bound I' vc that goes 
approximately from zero to 1.0. For most of the values of K , S' top is a slow-changing function 
of K as compared to I' vc . 
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FIG. 1: Any undirected edge in the network can be viewed as consisting of an outgoing edge and 
an incoming edge in the random walk models. 



FIG. 2: The network 1 with M\ nodes and total degree K ^ is linked to the network 2 with M -2 
nodes and total degree K . The number of linking edges i between the network 1 and 2 is assumed 
to be small such that t « K^ l \ is satisfied. In the figure only the linked nodes between the 
two networks are shown. 
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FIG. 3: The topological entropies for the five-node networks are Sg p = 9.488, Sg p = 9.691, Sg p = 
9.835, Sgp = 10.575, S t % = 10.719, S[ op = 10.863, Sg p = 11.339, S t % = 11.541, S J top = 11.975, 
Stop = 12.119, Sg p = 12.725, and Sg p = 13.635. The numbers in the figure denote the average 


clustering coefficient ( C) for each network. 



FIG. 4: A network with self-linked edges. Node a has one self-linked edge and node c has two 


self-linked edges. 
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FIG. 5: Plots of network descriptors Stop, Ivc, B2, and (C) for the five-vertex graphs listed in Fig. 
[ 3 } The descriptors Stop, Ivc, and B2 are the normalized indices of Stop, Ivc, and B2. 



FIG. 6: The normalized upper bounds S' top (truncated) and I' vc for 100-vertex networks as functions 
of the total degree K. 
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